DOCUMENT BE50HE 

ED 085 084 PS 006 878 



AUTHOfi 
TITLE 



INSTITUTION 



SPONS AGENCY 

REPORT NO 
BUREAU NO 
PUB DATE 
GRANT 
NOTE 

EDBS PRICE 
DESCRIPTORS 



IDENTIFIERS 



(DHEW/OE) , 
1 



Feldinaiir David H. 

The Fixed-Sequence Hypothesis: Individual Differences 
in the Development of School Related Spatial 
Reasoning. 

tlinnesota Univ., Minneapolis. Researchf Development, 
and Demonstration Center in Education of Handicapped 
Children. 

Bureau of Education for the Handicapped 
Washington, D.C. 
RR-1 
332189 
Mar 70 

OEG-09-332 189-4533 (03 2) 
47p. 

HF-$0.65 HC-$3.29 

Age Differences; *Cognitive Development; *Ccncept 
Formation; Cultural Differences; *Elementary School 
Students; ^Individual Differences; *Junior High 
School Students 

♦Cognitive Developmental Theory ; Invariant Stages ; 
Spatial Reasoning 



ABSTRACT 

In order to gather data bearing on the effects of 
individual differences of the "invariant stages" assumption of 
cognitive-developmental theory, 270 black, Chinese, and white 
subjects from 5th, 7th, and ninth grades were administered a new, 
25-item, spatial reasoning task. It was predicted that age, ethnic 
group, and sex would significantly influence mean levels of 
achievement, but that analyses of patterns of item performance would 
reveal a similar sequence of concept and skill acquisition regardless 
of ethnic group means. These predictions were called the 
"fixed-sequence hypothesis" • Age and ethnic group were significant 
influences on performance; sex was not. Scalogram analyses showed 
that each ethnic group's performance tended to form a scalable item 
set, but for a somewhat different ordering of the 25 items. Thus, the 
results were inconsistent with cognitive-developmental theory. It was 
condluded that individual differences are likely to affect 
development when sequences are highly task specific, but that the 
bulk of the evidence still supports a general sequence of stages in 
cognitive development. Implications for developmental theory, 
intelligence testing, and curriculum planning are discussed. 
(Author/DP) 
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Abstract 

In order to gather data beardng on the effects of individual 
differences on the 'invariant stages' assumption of cognitive- 
developmental theory, 270 5th 7th and 9th-grade Black, Chinese, 
and White subjects, 90 at each grade level and of similar SES, were 
administered a new 25 item spatial reasoning (map-reading) task. 
It was predicted that age, ethnic group, and sex significantly 
influence mean levels of achievement, but that analyses of patterns 
of item performance would reveal a similar sequence of concept and 
skill acquisition regardless of ethnic group means. These predic- 
tions were called 'the fixed— sequence hypothesis.' Age and ethnic 
group were significant influences on performance; sex was not. 
Scalogram analyses showed that each ethnic group's performance 
tended to form a scalable item set, but for a somewhat different 
ordering of the 25 items. Thus, the results were inconsibtent 
with cognitive-developmental theory. It was concluded that indivi- 
dual differences are likely to affect development when sequences 
are highly task specific, but that the bulk of the evidence still 
supports a general sequence of stages in cognitive development. 
Implications for devalopmental theory, intelligence testing, and 
curriculum planning are discussed. 



THE FIXED-SEQUENCE HYPOTHESIS: INDIVIDUAL DIFFERENCES 



IN THE DEVELOPMENT OF SCHOOL RELATED SPATIAL REASONING 

David H. Feldman 
University of Minnesota 

T he Problem 

Kohlberg (1968) wrote that an invariant sequence of stages 
of cognitive development is essential to the 'cognitive-develop- 
mental' point of view* In calling for empirical data bearing on 
the question of invariant stages, Kohlberg commented that: 

•..it is extremely important to test whether a 
set of theoretical stages does meet the empirical 
criteria,.. If empirical sequence was not found, one 
would argue that the 'stages' simply constituted 
alternative types of organization of varying com- 
plexity, each of which might develop independiently 
of the other (p* 1022), 

Inhelder (1968) , in a revision of her 1943 work on reasoning 

In the mentally retarded, also emphasized the importance of using 

objective statistical methods for assessing stages in reasoning 

development. Inhelder noted that results in the Genevan laboratory 

tend to indicate that rates of development differ among subjects, 

but sequences of cognitive devielopment are invariant: 

In order to use statistics to determine whether the 
developmental succession of certain behaviors does 
follow such a hierarchical process, or whether it is 
simply a question of fortuitous temporal succession, 
it Is necessary to resort to procedures of hierarchical 
analysis such as Guttman introduced into social psychology. 
We owe a debt to L.J. Cronbach for having suggested that 
we use ordinal methods as far back as 1954. Vinh Bang... 
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found that the indices of reproducibility for the solu- 
tions furnished for various tests are not the same for 
all ages. This is of great interest .because it pro- 
motes understanding of the dynamics inherent in the 
normal evolutionary process, in that it implies that the 
diverse types o£ operational behavior do not always 
follow the same speed of development... (pp. 319-320). 

Thus, two key assumptions of the theory, as expressed by Inhelder, 
are that development rates may differ ^imong subjects, but develop- 
mental sequences do not. 

Wallach (1963), in his review of research on children's 
thinking, wrote; "Concerning the effects of sociocultural factors, 
variation in absolute age norms has been found but not in over-all 
developmental sequences (p. 257)." A critical review of the recent 
evidence bearing on the invariant sequence assumption reveals that 
this conclusion is only partially supported by the data. While 
a general sequence of stages in achievement of cognitive levels 
does seem to hold across tasks and samples, several specific 
discrepancies between theoretical requirements and empirical 
evidence have been found. 

Dodwell (1960, 1968) and Wohiwlll (1960), in their studies 
of the development of number concepts ^ found general support for 
a fixed sequence of acquisition, but noted that several tasks 
were mastered earlier or later than predicted by Piaget. Dodwell 
(1968) concluded that "on the basis of the results here reported... 
the pattern of development is neither as regular, nor as simple. 



as Plaget has suggested*' (p. 112). *Kofsky (1966), In her study 
of classlflcatory development, found that individuals vary in the 
steps by which they master cognitive task, and that the evidence 
for an invariant sequence supports *'at best... a partial order'' 
(p. 200). Uzgiris (1964) studied conservation of size, weight, 
and volume using several different materials (e.g., metal cubes, 
plastic balls, etc.) and found that individuals varied in the 
extent to which they conserved with different sets of materials - 

Goldman (1965), investigating the understanding of religious 
concepts, got general support for Piaget's stages, but found that 
the order of task difficulty was not always a«? Piaget's theory 
requires. Similar results were reported by Larsen (1969) for spatial 
concepts. To produce a fixed sequence (in operational terms, to 
produce a scalable set of tasks or items), several researchers 
have reduced the number of steps in a sequence by combining or 
dropping tasks and/or changing the criteria for passing or failing 
Items (e.g.. Peel, 1959; Larsen, 1969; Siegelman & Block, 1969). 

Thus, it appears that the more specifically one attempts to 
operationalize a sequence of steps in development, the more likely 
one is to find discrepancies between theory and evidence. One 
possible explanation for these discrepancies is that experiences 
such as family practices and educational history have little effect 
on general sequences of development, but may influence more 
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specific steps or processes through *which a general stage of 
cognitive functioning is achieved, Uzgiris' (1964) results for 
conservation using different materials may support this interpreta- 
tion; Uzgiris found individual differences regarding the materials 
used in conservation, although the sequence from nonconservation 
to conservation appeared to be invariant • 

Very little data bearing on the possible effects of individual 
or group differences on sequences of development has been collected, 
Bruner, Olver and Greenfield (1966) reported several studies com- 
paring Wolof, Mexican, Eskimo, and American children on several 
Piagetian tasks. In general, Bruner eit al found that schooling 
vs. lack of schooling and urban vs, rural environments were often 
more significant than cultural differences in affecting cognitive 
development levels; sequences of development were not systematically 
studied, however. Bruner ejt ajL suggested that there may be 
different modes of achieviii^g the same cognitive goals; for 
example, Wolof and American children both achieved conservation, 
but they appeared to achieve it through different reasoning pro- 
cesses. 

Studies by Kohlberg (1963) with Atayal children and Stodolsky 
(1965) with Black children applied sequential scaling techniques 
to dream concepts (Kohlberg) and classification tasks (Stodolsky). 
Kohlberg reported comparable scalability for the development of 
dream concepts in Atayal and White children; Stodolsky reported 
comparable Guttman scaling patterns in classification skills for 

O 
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Black and White lower-class children in Chicago. Both studies 
confirmed previously reported differences in levels of performance 
but invariant sequences. 

Turiel (1969) summarized several studies of moral develop- 
ment in groups of subjects varying in social class (Kohlberg, 1958), 
urban-rural environments (Owen, 1968), and cultaral background 
(Kohlberg, 1966; Kramer, 1968). With few exceptions, the data 
appear to support the invariant stage sequence assumption of 
moral development. Turiel' s own work is especially noteworthy for 
its substantive findings and for its novel methodology, which 
sheds light on the processes of transition between stages as 
well as the sequentiality o± stages. 

The present study initiates a program of research aimed at 
further testing of the possible effects of group and individual 
differences on sequences of cognitive development. Because 
previous studies have found specific discrepancies between theory 
and evidence as a function of the level of specificity with which 
sequences are defined, the present study attempted to further test 
the invariant sequence assumption against a highly articulated 
sequence. An additional aim of the study was to base the hypothesized 
sequence on both theoretical grounds and a concep,txial analysis of 
the criterion concepts to be achieved, as suggested by Kohlib:erg 
(1968). A test instrument, validated in a pilot study (Feldman, 



1969), was designed to assess a hypothesized specific sequence of 
skills and concepts requisite to performance of a complex reasoning 
task* Ordinal scaling techniques were used to analyze developmental 
sequences (Guttman, 1947; Edwards, 1957). 

The instrument used in the study was intended to assess the 
influence of ethnic background on the acquisition of specific 
concepts and skills requisite to the proper drawing of a geographic 
map. The items for the instrument were derived from Salomon's 
(1968) task analyses of the geographic map, Piaget and Inhelder 
(1967), among others, have shown that map drawing is a useful task 
in- studying the development of spatial reasoning. The modern map, 
a unique product of Western culture and an example of a scientific 
"model, has also been shown to be difficult for non-Western children 
to understand (Salomon, 1968; Dart & Lai Pradham, 19.67). Finally, 
Lesser and his colleagues (Lesser, Fifer, & Clark, 1965; Stodolsky 
& Lesser, 1967) have found ethnic differences in spatial ability 
even when social class was controlled. Since social class has 
often been a confounding variable in studies which compare the 
performance of different ethnic groups (Stodolsky St Lesser, 1967; 
Hess & Shipman, 1965; Eisner, 1967)3 the present study selected 
samples of equivalent socio-economic level, but differing in ethnicity 
Hypotheses 

"The fixed-sequence hypothesis" is comprised of two related propo 
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sltlons, each of which was tested separately. First, it was hypothe- 
sized that with social class held constant there are differences in 
the level of achievement of the. three ethnic groups tested. Second, it 
was hypothesized that despite differences in the level of achievement 
exhibited by the ethnic groups, sequences of acquisition of concepts 
and skills would not significantly differ across ethnic group. 

In addition to the two parts of the fixed-sequence hypothesis, 
four secondary hypotheses bearing on the validity of the experimental 
task were tested. These hypotheses are presented in detail in the 
results section. 

A key assumption of the study was that developmental sequences 
of concept and skill acquisition are reflected in patterns of item 
performance, and that rates of development are reflected in mean scores 
for each ethnic group. Wohlwill (1960) and Kofsky (1966) have discussed 
some of the problems with this assumption, several of whic^i will be 
taken up in the discussion of the results of this study. 

Method 

Subjects a id. Sampling Techniques 

Subjects were 270 fifth, seventh, and ninth-grade students 
attending public schools in San Francisco, California:. The diffi^ 
culty in selecting schools with children of comparable social class 
but varying ethnic background was considerable, since social class 
and ethnicity tend to covary. Another difficulty, pointed out by 
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Deutsch (1967), is that the same dafa may mean different things 

for different ethnic groups. An income of $6000 for a Black family 

may moan something quite different than the same income for a 

Chinese family. These difficulties notwithstanding, income and education 

level of parents were taken as indicators of social class (Hess, 1968) • 

Table 1 presents the data for education and income of the populations 

of the schools selected for inclusion in the study. 

Schools were chosen with relatively homogeneous ethnic popu-- 
lations at the elementary level; the junior high school chosen 
had about the bame number of students from each of the three ethnic 
groups under investigation. Table 2 presents the ethnic mix of 
each of the schools selected. 

The neighborhoods were described as lower-middle class by 
several informants; direct observation tended to support this 
description. It was recently estimated by the local Chamber of 
Commerce that a $10,000 income is required for a family of Pour 
to live comfortably in San Francisco; the median income of the 
families in the study was $6900. 

Two additional Informal sampling procedures were used in 
the present study. School principals were advised to exclude 
children from the testing sample who were from 'upper-middle 
class' or from 'severely disadvantaged' homes. They were also 
told to select students from all ability levels and behavior 
patterns, including 'retarded' or 'disruptive^ students. Since 
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TABLE 1 

Median Income and Years of Education of Families 





Median Income 


Median 


Education 


Sample 


Population 


Elementary School B 


$6300.00 


12.1 


Years ' 


30 


459 


Elementary School C 


$7000.00 


12.4 


Years 


30 


684 


Elementary School W 


$7000.00 


11.9 


Years 


30 


303 


Junior High School 


$6700.00 


12.2 


Years 


180 


997 


Total 


$6900.00. 


12.2 


Years 


270 


2443 
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TABLE 2 

Ethnic Mix of the Schools Selected for Study 







%Black 


^Chinese 


Other Groups Tested 



Elementary School B 


14.8 


74.1 


4.1 


7.0 


Black 


Elementary School C 


4.0 


0.4 


94.3 


1.3 


Chinese 


Elementary School W 


71.3 


2.0 


1.3 


25.4 


White 


Junior High School 


40.7 


28.6 


17.2 


13.5 


All (B,C.\0 
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students were tracked in the junior high school, approximately 
equal numbers of students were selected from each track. 

Although not a random sample from a well-defined population, 
the subjects probably represent a large segment of the population of 
many American urban areas. However, the study is not generalizable 
to disadvantaged populations, since part of the current meaning of 
the term disadvantaged includes poverty. 

As stated earlier, the present study, attempted to control 
for social class while examining variation due to ethnic group. 
The lower-middle class sample finally selected proved to be the only 
social-class group on which it was possible to match the three 
ethnic groups selected for investigation. 
Instruments and Scoring 

The test instrument (map test) had two parts. A map-drawing 
task, similar to one used by Dart and Lai Pradham (1967) in Nepal 
and Hawaii, asked subjects to draw a map of the school and the school 
grounds as seen from above. The instrument was also similar to 
Eisner's (1967) task, in which subjects drew a picture of the school 
grounds and their friends, and to a Piagetian diagram-drawing task 
(Piaget & Inhelder, 1967). The map--drawing task of the present 
study put greater conceptual demands on the subject than previous 
drawing tasks used to assess cognitive development; Goodenough 
(1924) and Harris (1963) asked subjects to "draw-a-man, '*draw-a-woman , 
or "draw-a-tree," while Piaget and Inhelder (1967) and Salom.on (1968) 
provided a visual model which the subjects werer free to use in 
constructing their diagrams. Eisner (1967) asked children to 
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draw a picture of the school grounds', which required memory and 
visualization but did not demand abstract representation. Since 
performance on this map--drawing task required subjects to draw a map 
■from above with no memory support, it was expected that perfor- 
mance might fall somewhat below that found in previous studies. 
The purpose of the map-drawing task was to provide a criterion 
against which to validate a 25-item map-reading test (described 
below) • 

A scoring system was developed for the map-drawing task 
which took much of its content from Piaget and Inhelder (1967) 
and its scoring procedures from Eisner (1967). Based on Piagetian 
stages of spatial reasoning in diagrammatic layout construction, 
a six-category rating scale was developed. Each subject's map was 
rated independently by two trained judges (both of whom were graduat 
level psychology students). Judges were trained with written 
descriptions of the categories and with example maps produced 
by pilot study children (Feldman, 1969). Initial agreement in 
ratings between the judges was 50*60%. After four training sessions 
inter-* judge agreement reached a criterion of above 80%; the judges 
disagreed in 2% of the cases by more than one category. Following 
independent rating, the judges were broughi: together to discuss 
disparate ratings. Agreement on 97% of the drawings was achieved 
in this manner, comparable to results reported by Eisner (1967). 
The remaining 3% of the drawings were arbitrarily categorized by'^ 
the author. 
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The second part of the experimental task was a map-reading 
test. On the basis of results from a pilot study (Table 3, 
Feldman, 1968), a set of 25 map-reading tasks was arranged in a 
hypothetical fixed sequence of acquisition. The formal character- 
istics of the fixed sequence are equivalent to Gagne'' ' s (1962) 
"hierarchical learning sets," and were stated by Gagne'' as 
follows: 

1. If a higher-level learning set (task) is 
passed (+) , all related lower-level tasks must 
have been passed. 

2. If one or more lower- level tasks have been 
failed (-), the related higher- level tasks must 
be failed. 

3. If a higher-level task is passed (+) , no 
related lower-level tasks must have been 
failed (-). 

4. If a higher-level task has been failed (-), 
related lower-level tasks must have been passed. 

In his studies of learning sets, Gagne'' was not attempting 
to infer developmental sequences from his data. It is the distinc- 
tion between typical learning and typical developmental sequences 
that separates the present research from Gagne^'s work. The actual 
arrangement of the items in the test instrument was intended to 
form a 25-stage set of prerequisites to the criterion, i.e., to 
proper drawing of an abstract, formal map. 

Following Salomon (1968), the present map reading test used a 
fictional island as its visual stimulus. Subjects answered questions 



TABLE 3 



Concepts and Skills Assessed by Items in the Map Test 
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Concept or Skill 



Item Number (s) 



I. Identification of: 
Stimulus 
Ikons 

Signs and Symbols 



2, 3. 4 
5, 6, 7 



II. Analysis of: 



Distance 
Map Heights 
Filled vs. Empty Space 
Map Directions 
Personal and Ma? Heights 
Personal Heights 
Personal Directions 
Comparative Directions 
Knov;ledge of Latitude- 
Map Directions with Logic Problem 
Personal Directions v;ith Logic Problem 
Symbol Unders tanding-*Abstract 



8 
9 

10 

11, 12 

13 

14 

15 

16 

17 

18 

19 

20 



III. Interpretation based on: 

Inferences about Weather 
Choice of Capital with No Visual Symbol 
Two Smallest Towns 
Location for Harbor 
Complex Hypothesis; Fishing Village 
vs. Farming Tovm ' 



21 
22 
23 
24 

25 
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based on information in the visual stimulus, a copy of which was given 
each subject, and in materials provided in a prepackaged booklet. 
Each of the 25 tasks in the test was designed to reflect subject's 
level of acquisition of a concept or skill hypothesized to be requisite 
to proper performance in map drawing. The content of the items was 
based on Salomon's conceptual task analysis of the map and on Piaget 
and Inhelder's (1967) research on spatial reasoning (see Table 3), 
Of the 25 items, 17 were multiple-choice questions with four 
alternative answers and a blank space if subject wished to write 
his own ans^vTer, Each dis tractor in the multiple choice items was 
designed to reflect one of four different reasoning levels. Designing 
distractors to give information other than 'correctness' was suggested 
in a paper by Guttman and Schlesinger (1967) and is discussed in 
greater detail elsewhere (Feldman & Markwalder, 1970). This 
procedure allows for the differential scoring of subjects on the basis 
of the types of wrong answers to which they are attracted. Questions 
were designed so that each one of the distractors was intended to 
reflect either (a) tautological or imaginary reasoning (preconceptual 
thought in Piaget 's terms); (b) perceptual or associative reasoning 
(beginning of concrete operations); (c) concrete reasoning (logical 
but limited to information in the stimulus); or (d) foimal reasoning 
(not necessarily based on information directly obtainable from the 
stimulus). The four reasoning levels were extrapolated from Piaget 's 
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theory of cognitive development (Piaget, 1950, 1952; Flavell, 1963; 
Hunt, 1961; Sullivan, 1967) < 

Three sample items of increasing difficulty are presented below: 

^il What does the little drawing near the town of Koff tell you? 

a. It tells me something nice about the people who live 
in Koff (Imaginary •) 

b. It tells me there is a cow near Koff (Perceptual*) 

c. It tells me there is a dairy near Koff (Concrete.) 
d* It tells me that Koff is a place where cows are 

raised for milk and butter (Formal.) 



#8 How would you find out how far it is from one side of the 
map to the other? 

a. I would ask a man who lives on the island (Imaginary.) 

b* I would drive a car and look at the miles (Concrete.) 

c< I would have to see if the roads are good (Perceptual.) 
d. I would use a scale on the map. (Formal.) 



#21 Circle the words which tell you what you think the weather 
Is like on this island most of the time. List the reasons 
why you think the weather is what you say. 

Rainy (Formal); Icy (imaginary); Cold (Perceptual); Hot (Forma 

Foreign (Imaginary ); Sticky (Formal); Windy (Concrete-Percep- 
tual); Dry (Imaginary or Perceptual); 

Foggy (Concrete-Perceptual); Icky (Imaginary); Cool (Concrete- 
Perceptual); Snows (Perceptual or Imaginary); 

Humid (Formal); Pretty (Perceptual); Dusty [Perceptual); 
Smoggy (Concrete-Perceptual) ; Nice (Perceptual) . 

Map-reading test items were first scored for the number * correct' 

i.e., in a manner similar to a standard multiple-choice test. This 

score was referred to as the nap-reading test score (MS). MS was used 

to compare group achievement levels; items answered formally, and in 

some cases concretely, were considered as passes in the fixed-sequence 

analysis. 
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A second score was obtained by (Computing a mean reasoning level 
(RL) exhibited on the map-reading test, as follows: an imaginary/ 
tautological answer was scored 1; a perceptual/associative answer, 
2; a concrete answer, 3; a formal answer, 4. All 25 items were 
evaluated in this manner and a reasoning level score was computed by 
the formula: 

(Imaginary responses x 1) + (Perceptual x 2) 
^ + (Concrete x 3) + (Formal x 4) 

25 (Number of Items) 
Reasoning level and map-reading test score were related since in 
general the RL increases as the number of items answered correctly 
increases. However, two subjects with identical map-reading test 
scores could exhibit markedly different reasoning levels, depending 
on the level of reasoning exhibited in the answers they missed, as 
in the following hypothetical example: 

51 Total Score = 10 
Formal: 10 x 4 = 40 
Concrete; 15 x 3 =^ 45 

Reasoning Level = 90/25 =3.6 (Formal) 

52 Total Score = 10 
Formal: 10 x 4 = 40 
Imaginary; 15 x 1 = 15 

Reasoning Level = 55/25 =2.2 (Perceptual) 
RL was used to test the strength of the relationship between map- 
drawing categories and Piagetian stages of general cognitive development 
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Procedures for Administration 

A primary purpose of the testing procedure was to reduce, insofar 
as possible within practical constraints, the dependence of a child's 
performance on his ability to read. Another purpose was to reduce the 
anxiety of test-taking to a minimum so that each child had the best 
possible opportunity to exhibit his reasoning about spatial concepts. 
The examiner (E) read all directions, each question and its distractors 
aloud, proceeding as slowly as was necessary to insure each child the 
opportunity to think about and complete the items. Second, each 
examiner was of the same ethnic background as the children with whom 
she worked. Examiners of the same ethnic background as subjects were 
used by Lesser, Fifer, and Clark (1965) and Stodolsky and Lesser (1967) 
for similar purposes. 

Subjects were tested in groups of 10 to 15 in library, cafeteria, 
auditorium, or classroom facilities. Procedures for getting the children 
out of class and into testing rooms varied from school to school and 
may have contributed to uncontrolled variations in the testing 
situation. 

As the children entered the testing room, the following instructions 

were given by Examiner: 

Good morning (afternoon). My name is Mrs. ^ . 

This morning (afternoon) you and I are going to do a map 
exercise together. In this exercise, we will be interested 
in how you draw and how you think about maps. Now I will 
give each of you a booklet which has some blanks for you 
to fill in on the front. (Examiner distributed map tests.) 
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Please fill in each of the blanks; circle if you are a boy 
or a girl. Now in the blank where it says *^age" write 
your birth dat e, both the month and the year you were born. 

Now we will read the instructions, you to yourself and me 
out loud. (Instructions were read, making sure everybody 
understood that the drawing was to be of the school and the 
school grounds as seen from above.) 

On the next page, which is a blank page, draw a map of 
the school and the school grounds as seen from above. 
You will have ten minutes to draw your map. Then wait 
for your next set of instructions. Good Luckl 

O.K., finish up your drawings and we will go on to the 
rest of the exercise. Please don't turn the page until I 
ask you to do so. Now I am going to give each of you one 
of these (holds up stimulus map). For the rest of the work 
we will use these. Does everyone have one? (Examiner kept 
a map for herself to point out what a question was referring 
to. For example, when Examiner read the question having 
to do with the green color, she pointed to the green parts 
of the map.) 

VJhat we're going to do is not a test; it is an exercise 
to find out what you think are the best answers to some 
questions about maps. There will be no grade given; this 
is because we want to know your answers. You will have 
as much time as you need to do each question. If I go 
too fast, raise your hand, and we will wait. Remember, 
we are not interested in how fast you can go. I will 
read each question out loud as you read it to yourself. 
If you have trouble understanding the question the first 
time I read it, raise your hand, and I will repeat the 
question. Does everyone understand? 

Is everyone ready? O.K., let^s flip over the page and 
look at question #1. Notice that there are four possible 
answers and a space for you to write in your answer if 
do not find one which ;you think is best. (Repeated for 
each question. ) 

Testing was done in the morning and early afternoon hours. All 
groups were tested within ten days of each other (October 7 through 
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17). Testing time varied from about one hour with fifth graders to 
about 45 minutes with ninth-grade children. Examiners reported that 
in general the children at all three grade levels appeared well 
motivated to complete the task and did not find it too easy or too 
difficult* However, in a small number of cases, children were not 
given adequate time to finish their work, contributing further uncon- 
trolled variance to the results. 

Results and Discussion 

Map Test Validity Hypotheses . Results bearing on three secondary 
hypotheses of the study were intended to assess the validity of the 
map test tasks. These results are presented first because confidence 
in interpreting the fixed-sequence analysis depends upon adequate 
validation of the test instrument. 

Secondary hypothesis 1 predicted a significant, positive corre- 
lation between map-reading scores (MS) and map-drawing categories (MC) . 
If the map-reading test assesses the skills and concepts requisite 
to proper map-drawing behavior, a significant agreement between the 
two measures was expected* The correlation between MC and MS was 
found to be ,46 (p < .01) for the entire sample; subgroup correlations 
did not significantly differ from the total sample* Although in 
support of the hypothesis, the agreement between the two instruments 
accounted for only 21% of the variation in performance. Restricted 
variability in MC (six categories) may have affected the MC x MS 
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correlation, but on the basis of the present results, only modest 
support for the hypothesis was provided. 

Secondary hypothesis 2 predicted significant increases in MS 
with grade level. Since developmental inferences were made from the 
data, it was crucial that performance levels increase with grade level. 

Table A presents the means and standard deviations for each ethnic 
group at each grade level. Table 5 presents the results of an analysis 
of variance testing the effects of grade level on MS performance. The 
data show orderly increases in MS performance with each grade level 
and within each ethnic group (there were no significant sex differ- 
ences in MS). Grade level had a significant effect on MS performance 
according to the analysis of variance. 

Hypothesis 3 predicted that reasoning level (RL) would increase 
as MC increased. This hypothesis was intended to relate the map test's 
reasoning measures to Piaget and Inhelder's (1967) stages of spatial 
reasoning (i.e., to MC) . Since RL was intended to be a deeper measure 
of reasoning level than MS, and since Piaget claims that spatial 
reasoning is a special case of general reasoning development, RL and 
MC measures should be in substantial agreement. 

Figure 1 shows the median RL for subjects whose maps were cate- 
gorized in each map-drawing category. RL increased with MC up to 
category 6, at which point there was a decline in RL. However, out of 
270 map protocols, there were only two placed in category 6 by the judges 
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TABLE A 

Means and Standard Deviations of Map Test Scores 
For Each Ethnic Group at Each Grade Level 



Ethnic 


Grade Level 


Group 


5th 


7 th 


9th 


Total 


Black X 
SD 


11.97 
(3.81) 


13. AO 
(3.37) 


15.17 
(3.51) 


13.51 
(3.57) 


White X 
SD 


1A.20 
(3.57) 


16.17 
(3.15) 


17.20 
(3.67) 


15.85 
(3.A7) 


Chinese X 
SD 


1A.57 
(3.07) 


17. AO 
(3. A3) 


17.63 
(3.39) 


16.53 
(3.30) 


Total X 
SD 


13.53 
(3.50) 


15.63 
(3.32) 


16.63 
(3.52) 


• 

4 
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the RL estimate for category 6 may therefore have been unreliable. 
Thus, the hypothesis was generally supported by the data, but 
further research into the unpredicted results is caflled for, 

Fixed-Sequence Hypothesis , Part one of the fixed-sequence 
hypothesis (Hypothesis 5) predicted that ethnic group is a signi- 
ficant influence on MS levels. This hypothesis followed from the 
vast literature on differences between means for Black and White 
subjects on standard tests of IQ and achievement (e.g., Jensen, 1968; 
Tyler, 1965), even when social class has been equated (Bruce, 19A0; 
McQueen & Browning, 1960). Table 5 presents the results of the 
analysis of variance for ethnic group effects on MS performance. 
As shown in Table 5, ethnic group was a significant (p <*01) influence 
on MS. The mean differences in MS were taken to support the part 
of the f ixed-sequence hypothesis which predicted differing rates 
of development for each ethnic group. 

The fixed-sequence hypothesis (Hypothesis 6) also predicted 
that f^espite differences in mean levels of achievement on the map- 
reading tasks, all ethnic groups would exhibit a fixed sequence of 
item performance. Table 6 presents the results of scalogram analyses 
of the hypothetical fixed-sequence of item performance. The first 
column of figures in Table 6 is based on the original ordering of 
the set of 25 items of the map test. The Coefficients of Reproduci- 
bility (CR = l~The number of scaling errors over the total number of 
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TARLR 5 

Analysis of Variance of Kffects of Ethnic Group 
and Grade Level on Map Test Scores (N = 270) 



Source 



df 



MS 



F 



Ethnic Group 2 

Grade Level 2 

EG X GL A 

Error 261 



225.34 
223". 21 
5.49 
11.90 



19.02** 
18.76** 
<1 



**p < .01 
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Coefficients 
as Origina 


of Rcprouucibill Ly for the 25 Items in 
lly Ordered and as Reordered According 
Pass/Fail Frequency in Each Group 


the Map Test 
to Their 






Original Order 


Reordered 


M>rR 


Ethnic Group: 








Black 


.64 


.82 


• 74 


Chinese 


.55 


• 83 


.77 


White 


.58 


.85 


.79 


Combined Groups 


J 


.82 


.76 
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responses) reported for the original ordering of the it^ms do not 
approach an adequate Guttman Scale for any of the ethnic groups 
tested; criterion is usually ,85 to <90 (Guttman, 1947; Edwards, 1957), 
That is to say, the hypothetical ordering of the items did not scale 
as a fixed sequence for any of the three ethnic groups. 

It should be noted that Guttman scaling techniques do not 
permit analysis based on an arbitrary or hypothetical ordering 
of a set of items (Wohlwill, I960), Part of the process of obtain- 
ing scalability estimates is an empirical reordering of items accord- 
ing to their difficulty. Therefore, the reported coefficients should 
be viewed as rough estimates of scalability. Estimates were produced 
by modifying Gagne^'s (1962) technique for analyzing hierarchical 
learning sets; that is, by doubling the number of errors that were 
computed according to Gagne^'s technique (Elashoff, 1969). Reproduci- 
bility estimates were produced in this manner for the original ordering 
of the 25-map-reading items for each ethnic group. 

The second column of figures in Table 6 shows the Coefficients 
of Reproducibility for each ethnic group for the set of 25 items 
in the map test as reordered by the Cornell ranking technique, i,e,, 
ranked according to difficulty and rearranged to produce the fewest 
scale errors (Edwards, 1957). 

The results of the scalogram analyses with item sets and subjects 
reordered separately within each ethnic group (Table 6) indicated that 
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the three ethnic groups did not differ in the extent to which their 
overall performance conformed to the fixed-sequence model. Coeffi- 
cients of Reproducibility were ,82, ,83, and .85 for Black, Chinese, 
and White subjects. Thus, the results of the analyses indicate 
partial but not complete conformity to the fixed-sequence model. 
There are several factors which affected the reproducibility coeffi- 
cients, however, which should be taken into account. 

The machine program (BMD05S) which performed the analysis 
rearranged the items according to the "Cornell Ranking Technique" 
(Guttman, 1947), which reorders the items and subjects so as to minimize 
•errors,' thus giving a somewhat inflated estimate of scalability 
(Edwards, 1957). It is also true that the Coefficient of Reproduci- 
bility is necessary but not sufficient evidence for scalability. 
Also needed are items which vary in the percent of respondents in 
a given category. That is, if all subjects answer all questions 
correctly, the set of questions is scalable but trivial. The 
parameter describing the extent to which .items vary is the Minimal 
Marginal Reproducibility (MMR) measure (see Table 6), which refers 
to the mean per cent of subjects whose responses were in the same 
category. With the present set of items, dichotomized as they were 
into 'correct' or 'incorrect' responses, the MMR was high, but an 
inspection of the items indicated that they did have a wide range 
of difficulty. To some extent the high MMR was a function of dicho- 
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tomlzlng the item responses into two- categories; therefore, the 
CR is probably not greatly distorted by the high MMR, 

Wohlwill (1960) and Kofsky (1966) have pointed out some of the 
conceptual and procedural difficulties in using scalogram analysis 
results to infer developmental sequences. Most troublesome are the 
assumption that cross-sectional, discrete data reflects development; 
also troublesome are the possible effects of varying instructions 
and materials from task to task, and the possibility that the scalogram 
model may not be appropriate for all — or even most — developmental 
processes. Recent papers by Leik and Matthews (1968) and TenHouten 
(1969) are promising steps toward the refinement of developmental 
scaling techniques . 

Because of the way the fixed-sequence hypothesis was stated, 
the results could still be interpreted as generally supporting it. 
The prediction that the three ethnic groups would not differ in the 
extent to which their item performance conformed to the fixed-sequence 
model was supported by the data. However, the items had to be reordered 
in order for the hypothesis to be supported - 

Although not anticipated in the original testing of the fixed- 
sequence hypothesis, it became of interest to assess the extent to which 
the reorderings of items differed among ethnic groups. Table 7 
presents the 25 map-test items as reordered according to their 
difficulty for each group. Because of ties in difficulty, all three 
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sequences reduced from 25 to 20 steps. An examination of the reorderings 
of the items yields little systematic information; in general, it 
appears that many of the items did increase in difficulty for all groups 
as hypothesized, but the sequence for each group contains some distinct 
differences. Twelve of the items appeared to be about as easy or as 
difficult as predicted, and about equally difficult for each ethnic 
group (that is, they appear about the same place in the sequence for 
each group). An additional 7 items were either easier or harder than 
predicted, but not differentially so for any group. The remaining 6 
items ranked in difficulty differentially depending upon the ethnicity 
of the group. Further study of the items appearing at different points 
in the sequences is planned. 

Implications for Research and Practice . A number of possible 
implications of the present study should be investigated in future 
studies. The possible effects of order of item presentation on 
performance, curriculum, sequences, and intelligence test organization 
are of particular interest. 

The set of 25 items in the map-reading test was administered in 
a standard order to all subjects. If difficulty of items reflects 
sequences of development, the order in which the items are presented 
should make little difference in the production of a fixed sequence 



32 

or sequences. Future studies should randomize the items throughout 
the instrument to test this hypothesis. It is true, however, that 
the initial ordering of tasks in the test did not yield a fixed 
sequence for any of the three ethnic groups tested in the study; 
thus, it is equally possible that the order of presentation had 
disruptive effects on performance levels for all groups and that 
empirically derived sequences may facilitate performance of 
children with various characteristics. 

If sequences of presentation are shown to affect performance 
of children in systematic ways, curriculums may be made more 
sensitive to such influences as are found to exist. The assumption 
implicitly made by curriculum planners and teachers is that all children 
acquire knowledge and skills in the same sequence; curriculums are 
constructed with this assumption built in. If the. results of the 
present study are replicated and extended to other areas of the school 
curriculum, basic assumptions about curriculum construction may have 
to be revised. 

Another possible implication of f ixed-sequence research is in 
the area of intelligence testing. Since IQ tests are generally con- 
structed on the basis of item performance by members of a white 
middle-class normalization group, IQ tests may not be allowing for 
systematic differences in specific developmental sequences among 
children from various backgrounds. In a typical IQ test situation. 
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the session is terminated when a subject misses all items at a 
given level. Studies are planned in which IQ tests items are given 
in random order, and all subjects respond to all questions. Results 
will be analyzed in ways similar to those used in the fixed-sequence 
study reported here. It may be that children from differing back- 
grounds respond to item sequences in systematic . but different ways. 
The notion of a culture-fair IQ test may take on new meaning in 
the light of these studies. 

Conclusion . What light, if any, has the present study shed on 
the problem of invariant sequences of cognitive development? The 
results appear to contradict the cognitive-developmental assumption 
that all children develop skills and concepts in the same fixed 
sequence. The present study found that each ethnic group's perfor- 
mance approached a fixed sequence, but that the sequence was relatively 
distinct for the three groups. Three 'fixed sequences' were found 
where a single sequence was predicted. It should be noted again 
that the sequence tested in this study was highly specific and 
based on a task analysis of a single school task. 

The bulk of the evidence is still on the side of invariant 
sequences, but a task-specific sequence, as in the present study, does 
show variation according to differing ethnic background. In terms 
of possible educational significance, more specific sequences may 
have greater promise for instructional planning than broad, developmental 
stages of cognitive development* 
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It should be noted that ethnic group is but one ID variable 
that may effect development of task--specif ic reasoning. Ethnic group 
per se will probably turn out to be at best an imperfect indicator 
of more relevant cognitive variables. When educational treatments 
are devised which take ID variables into account, it is on the basis of 
cognitive variables rather than ethnic group membership that treatments 
are likely to be assigned. 

No adequate explanation for the unique reorderings has been 
proffered or verified, but several possible explanations are currently 
being investigated. The language of the items and instructions, the 
nature of the stimulus materials, sex differences, and of course the 
unanticipated cognitive demands o^ the questions themselves, are all 
possible sources of variation in the order of acquisition. 

There are a number of possible reasons why the results do not 
completely support the f ixed-sequence hypothesis. It may be, for 
example, that the measures of reasoning development in the study were 
superficial and do not represent true developmental stages. The skills 
requisite to map drawing may better reflect differences in learning 
histories and educational experiences than basic stages in cognitive 
development. There is, however, inherent difficulty in the argument 
over whether or not a given set of measures of fixed sequences is basic 
enough. The argument can be carried on a^ absurdum , with the critic 
at each turn accusing the investigator of being superficial, while 
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the investigator accuses the critic pf holding out until the data are 
consistent with his theoretical position. 

In the present study, an attempt was made to infer a single 
developmental sequence by testing children at three age levels. Since 
the level of performance increased as age increased, it was assumed 
that development took place with age. The process of development was 
assumed to be reflected in the pattern of responses made by the 
subjects and in the relationships predicted among various scores. 
Although by no means unique to the present study, problems in measuring 
development deserve greater attention. 
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